Solution of the linear quadratic regulator problem of black box linear systems using reinforcement learning
نویسندگان
چکیده
In this paper, a Q-learning algorithm is proposed to solve the linear quadratic regulator problem of black box systems. The only has access input and output measurements. A Luenberger observer parametrization constructed using control new obtained from factorization utility function. An integral reinforcement learning approach used develop approximator structure. gradient descent update rule estimate on-line parameters Q-function. Stability convergence under assessed Lyapunov stability theory. Simulation studies are carried out verify approach.
منابع مشابه
on the effect of linear & non-linear texts on students comprehension and recalling
چکیده ندارد.
15 صفحه اولLinear Quadratic Regulation using Reinforcement
In this paper we describe a possible way to make reinforcement learning more applicable in the context of industrial manufacturing processes. We achieve this by formulating the optimization task in the linear quadratic regulation framework, for which a conventional control theoretic solution exist. By rewriting the Q-learning approach into a linear least squares approximation problem, we can ma...
متن کاملEfficient Reinforcement Learning for High Dimensional Linear Quadratic Systems
We study the problem of adaptive control of a high dimensional linear quadratic (LQ) system. Previous work established the asymptotic convergence to an optimal controller for various adaptive control schemes. More recently, for the average cost LQ problem, a regret bound of O( √ T ) was shown, apart form logarithmic factors. However, this bound scales exponentially with p, the dimension of the ...
متن کاملNumerical solution of linear control systems using interpolation scaling functions
The current paper proposes a technique for the numerical solution of linear control systems.The method is based on Galerkin method, which uses the interpolating scaling functions. For a highly accurate connection between functions and their derivatives, an operational matrix for the derivatives is established to reduce the problem to a set of algebraic equations. Several test problems are given...
متن کاملReinforcement Learning Applied to Linear Quadratic Regulation
Recent research on reinforcement learning has focused on algorithms based on the principles of Dynamic Programming (DP). One of the most promising areas of application for these algorithms is the control of dynamical systems, and some impressive results have been achieved. However, there are significant gaps between practice and theory. In particular, there are no con vergence proofs for proble...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Information Sciences
سال: 2022
ISSN: ['0020-0255', '1872-6291']
DOI: https://doi.org/10.1016/j.ins.2022.03.004